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3 <110> APPLICANT: Bogosian, Gregg 

4 O'Neill, Julia P. 

5 Smith, Hong Q. 

7 <120> TITLE OF INVENTION: Prevention of Incorporation of Non-Standard Amino 
Acids into 

8 Protein 
10 <130> FILE REFERENCE: 11916 . 0059 . PCUSOl 

C--> 12 <140> CURRENT APPLICATION NUMBER: US/10/572,711 
C--> 12 <141> CURRENT PILING DATE: 2006-03-20 

12 <150> PRIOR APPLICATION NUMBER: PCT/US 2004/031224 

13 <151> PRIOR FILING DATE: 2004-09-23 

15 <150> PRIOR APPLICATION NUMBER: US 60/505,807 

16 <151> PRIOR FILING DATE: 2003-09-25 
18 <160> NUMBER OF SEQ ID NOS : 16 
20 <170> SOFTWARE: Patentin version 3.3 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 1344 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Escherichia coli 

27 <400> SEQUENCE: 1 

28 atggatcaga cat at t etc t ggagtcattc ctcaaccatg tccaaaagcg cgacccgaat 
30 caaaccgagt tcgcgcaagc cgttcgtgaa gtaatgacca cactctggcc ttttcttgaa 
32 caaaatccaa aatatcgcca gatgtcatta ctggagcgtc tggttgaacc ggagcgcgtg 
34 atccagtttc gcgtggtatg ggttgatgat cgcaaccaga tacaggtcaa ccgtgcatgg 
36 cgtgtgcagt tcagctctgc catcggcccg tacaaaggcg gtatgcgctt ccatccgtca 
38 gttaaccttt ccattctcaa attcctcggc tttgaacaaa ccttcaaaaa tgccctgact 
40 actctgccga tgggcggtgg taaaggcggc agcgatttcg atccgaaagg aaaaagcgaa 
42 ggtgaagtga tgcgtttttg ccaggcgctg atgactgaac tgtatcgcca cctgggcgcg 
44 gataccgacg ttccggcagg tgatatcggg gttggtggtc gtgaagtcgg ctttatggcg 
46 gggatgatga aaaagctctc caacaatacc gcctgcgtct tcaccggtaa gggcctttca 
48 tttggcggca gtcttattcg cccggaagct accggctacg gtctggttta tttcacagaa 
50 gcaatgctaa aacgccacgg tatgggtttt gaagggatgc gcgtttccgt ttctggctcc 
52 ggcaacgtcg cccagtacgc tatcgaaaaa gcgatggaat ttggtgctcg tgtgatcact 
54 gcgtcagact ccagcggcac tgtagttgat gaaagcggat tcacgaaaga gaaactggca 
56 cgtcttatcg aaatcaaagc cagccgcgat ggtcgagtgg cagattacgc caaagaattt 
58 ggtctggtct atctcgaagg ccaacagccg tggtctctac cggttgatat cgccctgcct 
60 tgcgccaccc agaatgaact ggatgttgac gccgcgcatc agcttatcgc taatggcgtt 
62 aaagccgtcg ccgaaggggc aaatatgccg accaccatcg aagcgactga actgttccag 
64 caggcaggcg tactatttgc accgggtaaa gcggctaatg ctggtggcgt cgctacatcg 
66 ggcctggaaa tggcacaaaa cgctgcgcgc ctgggctgga aagccgagaa agttgacgca 
68 cgtttgcatc acatcatgct ggatatccac catgcctgtg ttgagcatgg tggtgaaggt 
70 gagcaaacca actacgtgca gggcgcgaac attgccggtt ttgtgaaggt tgccgatgcg 
72 atgctggcgc agggtgtgat ttaa 
75 <210> SEQ ID NO: 2 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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840 
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1140 
1200 
1260 
1320 
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76 <211> LENGTH 


: 447 
























77 <212> TYPE: ] 


PRT 


























78 <213> ORGANISM: ; 


Escherichia ( 


coli 


















80 <400> SEQUENCE: ; 


2 
























82 Met Asp Gin Thr ' 


ryr Ser Leu Glu i 


Ser : 


Phe Leu Asn His Val Gin Lys 


83 1 






1 


5 








10 








15 




86 Arg Asp : 


Pro Asn Gin Thr Glu : 


Phe Ala Gin Ala Val Arg Glu Val Met 


87 




20 








25 








30 






90 Thr Thr Leu Trp : 


Pro Phe Leu Glu Gin Asn Pro Lys Tyr Arg Gin Met 


91 


35 








40 








45 








94 Ser Leu Leu Glu Arg Leu Val Glu : 


Pro Glu Arg Val He Gin ] 


Phe Arg 


95 50 








55 








60 










98 Val Val Trp Val Asp Asp Arg Asn < 


Sin ; 


He Gin Val Asn Arg Ala Trp 


99 65 








70 








75 








80 


102 Arg 


Val 


Gin 


Phe 


Ser 


Ser 


Ala 


He 


Gly 


Pro 


Tyr 


Lys 


Gly 


Gly 


Met 


Arg 


103 








85 










90 










95 




106 Phe 


His 


Pro 


Ser 


Val 


Asn 


Leu 


Ser 


He 


Leu 


Lys 


Phe 


Leu 


Gly 


Phe 


Glu 


107 






100 










105 










110 






110 Gin 


Thr 


Phe 


Lys 


Asn 


Ala 


Leu 


Thr 


Thr 


Leu 


Pro 


Met 


Gly 


Gly 


Gly 


Lys 


111 




115 










120 










125 








114 Gly 


Gly 


Ser 


Asp 


Phe 


Asp 


Pro 


Lys 


Gly 


Lys 


Ser 


Glu 


Gly 


Glu 


Val 


Met 


115 


130 










135 










140 










118 Arg 


Phe 


Cys 


Gin 


Ala 


Leu 


Met 


Thr 


Glu 


Leu 


Tyr 


Arg 


His 


Leu 


Gly 


Ala 


119 145 










150 










155 










160 


122 Asp 


Thr 


Asp 


Val 


Pro 


Ala 


Gly 


Asp 


He 


Gly 


Val 


Gly 


Gly 


Arg 


Glu 


Val 


123 








165 










170 










175 




126 Gly 


Phe 


Met 


Ala 


Gly 


Met 


Met 


Lys 


Lys 


Leu 


Ser 


Asn 


Asn 


Thr 


Ala 


Cys 


127 






180 










185 










190 






130 Val 


Phe 


Thr 


Gly 


Lys 


Gly 


Leu 


Ser 


Phe 


Gly 


Gly 


Ser 


Leu 


He 


Arg 


Pro 


131 




195 










200 










205 








134 Glu 


Ala 


Thr 


Gly 


Tyr 


Gly 


Leu 


Val 


Tyr 


Phe 


Thr 


Glu 


Ala 


Met 


Leu 


Lys 


135 


210 










215 










220 










138 Arg 


His 


Gly 


Met 


Gly 


Phe 


Glu 


Gly 


Met 


Arg 


Val 


Ser 


Val 


Ser 


Gly 


Ser 


139 225 










230 










235 










240 


142 Gly 


Asn 


Val 


Ala 


Gin 


Tyr 


Ala 


He 


Glu 


Lys 


Ala 


Met 


Glu 


Phe 


Gly 


Ala 


143 








245 










250 










255 




146 Arg 


Val 


He 


Thr 


Ala 


Ser 


Asp 


Ser 


Ser 


Gly 


Thr 


Val 


Val 


Asp 


Glu 


Ser 


147 






260 










265 










270 






150 Gly 


Phe 


Thr 


Lys 


Glu 


Lys 


Leu 


Ala 


Arg 


Leu 


He 


Glu 


He 


Lys 


Ala 


Ser 


151 




275 










280 










285 








154 Arg 


Asp 


Gly 


Arg 


Val 


Ala 


Asp 


Tyr 


Ala 


Lys 


Glu 


Phe 


Gly 


Leu 


Val 


Tyr 


155 


290 










295 










300 










158 Leu 


Glu 


Gly 


Gin 


Gin 


Pro 


Trp 


Ser 


Leu 


Pro 


Val 


Asp 


He 


Ala 


Leu 


Pro 


159 305 










310 










315 










320 


162 Cys 


Ala 


Thr 


Gin 


Asn 


Glu 


Leu 


Asp 


Val 


Asp 


Ala 


Ala 


His 


Gin 


Leu 


He 


163 








325 










330 










335 




166 Ala 


Asn 


Gly 


Val 


Lys 


Ala 


Val 


Ala 


Glu 


Gly 


Ala 


Asn 


Met 


Pro 


Thr 


Thr 


167 






340 










345 










350 






170 He 


Glu 


Ala 


Thr 


Glu 


Leu 


Phe 


Gin 


Gin 


Ala 


Gly 


Val 


Leu 


Phe 


Ala 


Pro 
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174 Gly Lys Ala Ala 

175 370 



171 355 



Asn Ala Gly 
375 

Ala Arg Leu 

390 



360 

Gly Val 



365 

Ala Thr Ser Gly Leu Glu Met 



182 Arg Leu His His 
183 



178 Ala Gin Asn Ala 

179 385 



186 Gly Gly Glu Gly 

187 420 

190 Gly Phe Val Lys 

191 435 



Glu Gin Thr 



Val Ala Asp 



lie Met Leu 
405 



Ala Met 
440 



Asp lie 



Asn Tyr 
425 



Gly Trp 



380 

Lys Ala Glu Lys Val Asp Ala 
395 400 
His His Ala Cys Val Glu His 
410 415 
Val Gin Gly Ala Asn lie Ala 
430 

Leu Ala Gin Gly Val He 
445 



194 <210> SEQ ID NO: 3 

195 <211> LENGTH: 1344 

196 <212> TYPE: DNA 

197 <213> ORGANISM: Escherichia coli 

199 <400> SEQUENCE: 3 

200 atggatcaga catattctct ggagtcattc ctcaaccatg tccaaaagcg cgacccgaat 60 
2 02 caaaccgagt tcgcgcaagc cgttcgtgaa gtaatgacca cactctggcc ttttcttgaa 12 0 
204 caaaatccaa aatatcgcca gatgtcatta ctggagcgtc tggttgaacc ggagcgcgtg 180 
206 atccagtttc gcgtggtatg ggttgatgat cgcaaccaga tacaggtcaa ccgtgcatgg 240 
2 08 cgtgtgcagt tcagctctgc catcggcccg tacctgggcg gtatgcgctt ccatccgtca 300 
210 gttaaccttt ccattctcaa attcctcggc tttgaacaaa ccttcaaaaa tgccctgact 360 
212 actctgccga tgggcggtgg taaaggcggc agcgatttcg atccgaaagg aaaaagcgaa 420 
214 ggtgaagtga tgcgtttttg ccaggcgctg atgactgaac tgtatcgcca cctgggcgcg 480 
216 gataccgacg ttccggcagg tgatatcggg gttggtggtc gtgaagtcgg ctttatggcg 540 
218 gggatgatga aaaagctctc caacaatacc gcctgcgtct tcaccggtaa gggcctttca 600 
220 tttggcggca gtcttattcg cccggaagct accggctacg gtctggttta tttcacagaa 660 
222 gcaatgctaa aacgccacgg tatgggtttt gaagggatgc gcgtttccgt ttctggctcc 720 
224 ggcaacgtcg cccagtacgc tatcgaaaaa gcgatggaat ttggtgctcg tgtgatcact 780 
226 gcgtcagact ccagcggcac tgtagttgat gaaagcggat tcacgaaaga gaaactggca 840 
228 cgtcttatcg aaatcaaagc cagccgcgat ggtcgagtgg cagattacgc caaagaattt 900 
230 ggtctggtct atctcgaagg ccaacagccg tggtctctac cggttgatat cgccctgcct 960 
232 tgcgccaccc agaatgaact ggatgttgac gccgcgcatc agcttatcgc taatggcgtt 1020 
234 aaagccgtcg ccgaaggggc aaatatgccg accaccatcg aagcgactga actgttccag 1080 
236 caggcaggcg tactatttgc accgggtaaa gcggctaatg ctggtggcgt cgctacatcg 1140 
238 ggcctggaaa tggcacaaaa cgctgcgcgc ctgggctgga aagccgagaa agttgacgca 1200 
240 cgtttgcatc acatcatgct ggatatccac catgcctgtg ttgagcatgg tggtgaaggt 1260 
242 gagcaaacca actacgtgca gggcgcgaac attgccggtt ttgtgaaggt tgccgatgcg 1320 
244 atgctggcgc agggtgtgat ttaa 1344 

247 <210> SEQ ID NO: 4 

248 <211> LENGTH: 447 

249 <212> TYPE: PRT 

250 <213> ORGANISM: Escherichia coli 
252 <400> SEQUENCE: 4 

254 Met Asp Gin Thr Tyr Ser Leu Glu Ser Phe Leu Asn His Val Gin Lys 

255 15 10 15 

258 Arg Asp Pro Asn Gin Thr Glu Phe Ala Gin Ala Val Arg Glu Val Met 

259 20 25 30 

2 62 Thr Thr Leu Trp Pro Phe Leu Glu Gin Asn Pro Lys Tyr Arg Gin Met 
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263 




35 










40 










45 








266 


Ser 


Leu Leu 


Glu 


Arg 


Leu 


Val 


Glu 


Pro 


Glu 


Arg 


Val 


He 


Gin 


Phe 


Arg 


267 




50 








55 










60 










270 


Val 


Val Trp 


Val 


Asp 


Asp 


Arg 


Asn 


Gin 


He 


Gin 


Val 


Asn 


Arg 


Ala 


Trp 


271 


65 








70 










75 










80 


274 


Arg 


Val Gin 


Phe 


Ser 


Ser 


Ala 


He 


Gly 


Pro 


Tyr 


Leu 


Gly 


Gly 


Met 


Arg 


275 








85 










90 










95 




278 


Phe 


His Pro 


Ser 


Val 


Asn 


Leu 


Ser 


He 


Leu 


Lys 


Phe 


Leu 


Gly 


Phe 


Glu 


279 






100 










105 










110 






282 


Gin 


Thr Phe 


Lys 


Asn 


Ala 


Leu 


Thr 


Thr 


Leu 


Pro 


Met 


Gly 


Gly 


Gly 


Lys 


283 




115 










120 










125 








286 


Gly 


Gly Ser 


Asp 


Phe 


Asp 


Pro 


Lys 


Gly 


Lys 


Ser 


Glu 


Gly 


Glu 


Val 


Met 


287 




130 








135 










140 










290 


Arg 


Phe Cys 


Gin 


Ala 


Leu 


Met 


Thr 


Glu 


Leu 


Tyr 


Arg 


His 


Leu 


Gly 


Ala 


291 


145 








150 










155 










160 


294 


Asp 


Thr Asp 


Val 


Pro 


Ala 


Gly 


Asp 


He 


Gly 


Val 


Gly 


Gly 


Arg 


Glu 


Val 


295 








165 










170 










175 




298 


Gly 


Phe Met 


Ala 


Gly 


Met 


Met 


Lys 


Lys 


Leu 


Ser 


Asn 


Asn 


Thr 


Ala 


Cys 


299 






180 










185 










190 






302 


Val 


Phe Thr 


Gly 


Lys 


Gly 


Leu 


Ser 


Phe 


Gly 


Gly 


Ser 


Leu 


He 


Arg 


Pro 


303 




195 










200 










205 








306 


Glu 


Ala Thr 


Gly 


Tyr 


Gly 


Leu 


Val 


Tyr 


Phe 


Thr 


Glu 


Ala 


Met 


Leu 


Lys 


307 




210 








215 










220 










310 


Arg 


His Gly 


Met 


Gly 


Phe 


Glu 


Gly 


Met 


Arg 


Val 


Ser 


Val 


Ser 


Gly 


Ser 


311 


225 








230 










235 










240 


314 


Gly 


Asn Val 


Ala 


Gin 


Tyr 


Ala 


He 


Glu 


Lys 


Ala 


Met 


Glu 


Phe 


Gly 


Ala 


315 








245 










250 










255 




318 


Arg 


Val He 


Thr 


Ala 


Ser 


Asp 


Ser 


Ser 


Gly 


Thr 


Val 


Val 


Asp 


Glu 


Ser 


319 






260 










265 










270 






322 


Gly 


Phe Thr 


Lys 


Glu 


Lys 


Leu 


Ala 


Arg 


Leu 


He 


Glu 


He 


Lys 


Ala 


Ser 


323 




275 










280 










285 








326 


Arg 


Asp Gly 


Arg 


Val 


Ala 


Asp 


Tyr 


Ala 


Lys 


Glu 


Phe 


Gly 


Leu 


Val 


Tyr 


327 




290 








295 










300 










330 


Leu 


Glu Gly 


Gin 


Gin 


Pro 


Trp 


Ser 


Leu 


Pro 


Val 


Asp 


He 


Ala 


Leu 


Pro 


331 


305 








310 










315 










320 


334 


Cys 


Ala Thr 


Gin 


Asn 


Glu 


Leu 


Asp 


Val 


Asp 


Ala 


Ala 


His 


Gin 


Leu 


He 


335 








325 










330 










335 




338 


Ala 


Asn Gly 


Val 


Lys 


Ala 


Val 


Ala 


Glu 


Gly 


Ala 


Asn 


Met 


Pro 


Thr 


Thr 


339 






340 










345 










350 






342 


lie 


Glu Ala 


Thr 


Glu 


Leu 


Phe 


Gin 


Gin 


Ala 


Gly 


val 


Leu 


Phe 


Ala 


Pro 


343 




355 










360 










365 








346 


Gly 


Lys Ala 


Ala 


Asn 


Ala 


Gly 


Gly 


Val 


Ala 


Thr 


Ser 


Gly 


Leu 


Glu 


Met 


347 




370 








375 










380 










350 


Ala 


Gin Asn 


Ala 


Ala 


Arg 


Leu 


Gly 


Trp 


Lys 


Ala 


Glu 


Lys 


Val 


Asp 


Ala 


351 


385 








390 










395 










400 


354 


Arg 


Leu His 


His 


He 


Met 


Leu 


Asp 


He 


His 


His 


Ala 


Cys 


Val 


Glu 


His 


355 








405 










410 










415 




358 


Gly 


Gly Glu 


Gly 


Glu 


Gin 


Thr 


Asn 


Tyr 


Val 


Gin 


Gly 


Ala 


Asn 


He 


Ala 


359 






420 










425 










430 
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362 Gly Phe Val Lys Val Ala Asp Ala Met Leu Ala Gin Gly Val He 

363 435 

366 <210> SEQ ID NO: 5 

367 <211> LENGTH: 1101 

368 <212> TYPE: DNA 

369 <213> ORGANISM: Bacii: 

371 <400> SEQUENCE: 5 

372 atgacattag aaatcttcga 
374 caagataaag aatctggttt 
3 76 gctcttggtg gaacaagaat 
378 ttgcgtcttg caaaagggat 
380 gcgaaaacag taattatcgg 
3 82 ctaggacgtt atatccaagg 
384 acagtagatg atatggatat 
386 tcattcggtt cttctggtaa 
388 aaagcagctg caaaagaagc 
390 caaggcgttg gtaacgtagc 
392 ttaatcgtta cagatattaa 
394 tcagcagttg aaccaaatga 
396 ctaggcgcaa cagttaatga 
3 98 tctgcaaata accaattaaa 
400 gtatacgcac cagattatgt 
402 tatggataca atagagaacg 
404 aaagtaatcg aaatttcaaa 
406 gctgaagagc gcattgcaag 
408 gatattatta gccgtcgcta 

411 <210> SEQ ID NO: 6 

412 <211> LENGTH: 366 



413 


<212> TYPE: 


PRT 


























414 


<213> ORGANISM: 


Bacillus cereus 


















416 


<400> SEQUENCE: 


6 
























418 


Met 


Thr 


Leu 


Glu 


He 


Phe 


Glu 


Tyr 


Leu 


Glu 


Lys 


Tyr 


Asp 


Tyr 


Glu 


Gin 


419 


1 








5 










10 










15 




422 


Val 


Val 


Phe 


Cys 


Gin 


Asp 


Lys 


Glu 


Ser 


Gly 


Leu 


Lys 


Ala 


He 


He 


Ala 


423 








20 










25 










30 






426 


He 


His 


Asp 


Thr 


Thr 


Leu 


Gly 


Pro 


Ala 


Leu 


Gly 


Gly 


Thr 


Arg 


Met 


Trp 


427 






35 










40 










45 








430 


Thr 


Tyr 


Asp 


Ser 


Glu 


Glu 


Ala 


Ala 


He 


Glu 


Asp 


Ala 


Leu 


Arg 


Leu 


Ala 


431 




50 










55 










60 










434 


Lys 


Gly 


Met 


Thr 


Tyr 


Lys 


Asn 


Ala 


Ala 


Ala 


Gly 


Leu 


Asn 


Leu 


Gly 


Gly 


435 


65 










70 










75 










80 


438 


Ala 


Lys 


Thr 


Val 


He 


He 


Gly 


Asp 


Pro 


Arg 


Lys 


Asp 


Lys 


Ser 


Glu 


Ala 


439 










85 










90 










95 




442 


Met 


Phe 


Arg 


Ala 


Leu 


Gly 


Arg 


Tyr 


He 


Gin 


Gly 


Leu 


Asn 


Gly 


Arg 


Tyr 


443 








100 










105 










110 






446 


He 


Thr 


Ala 


Glu 


Asp 


Val 


Gly 


Thr 


Thr 


Val 


Asp 


Asp 


Met 


Asp 


He 


He 


447 






115 










120 










125 








450 


His 


Glu 


Glu 


Thr 


Asp 


Phe 


Val 


Thr 


Gly 


He 


Ser 


Pro 


Ser 


Phe 


Gly 


Ser 


451 




130 










135 










140 
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440 445 



lus cereus 



atacttagaa aaatatgatt atgagcaagt agtattttgt 60 

aaaagcaatt attgcaattc atgatacaac acttggaccg 120 

gtggacatat gattctgaag aagcggcgat tgaagatgca 180 

gacatacaaa aacgcagcag ctggtttaaa cttaggtggt 240 

tgatcctcgt aaagataaga gcgaagcaat gttccgtgca 300 

actaaacgga cgttacatta cagctgaaga tgttggtaca 360 

tatccatgaa gaaactgact ttgtaacagg tatctcacca 420 

cccatctccg gtaactgcat acggtgttta ccgtggtatg 480 

tttcggtact gacaatttag aaggaaaagt aattgctgtt 540 

atatcaccta tgcaaacatt tacacgctga aggagcaaaa 600 

taaagaagct gtacaacgtg ctgtagaaga attcggtgca 660 

aatttacggt gttgaatgcg atatttacgc accatgtgca 720 

tgaaactatt ccacaactta aagcaaaagt aatcgcaggt 780 

agaagatcgt catggtgaca tcattcatga aatgggtatt 840 

aattaatgca ggtggcgtaa ttaacgtagc agacgaatta 900 

tgcactaaaa cgtgttgagt ctatttatga cacgattgca 960 

acgcgatggc atagcaactt atgtagcggc agatcgtcta 1020 

cttgaagaat tctcgtagca cttacttacg caacggtcac 1080 

a 1101 
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Input Set : A:\11916 •0059. PCUS01.ST25.txt 
Output Set: N:\CRF4\03292006\J572711.raw 

L:12 M:270 C: Current Application Number differs. Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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